Weighted entropy training for the decision tree based text-to-phoneme mapping
نویسندگان
چکیده
The pronunciation model providing the mapping from the written form of words to their pronunciations is called the text-to-phoneme (TTP) mapping. Such a mapping is commonly used in automatic speech recognition (ASR) as well as in text-to-speech (TTS) applications. Rule based TTP mappings can be derived for structured languages, such as Finnish and Japanese. Data-driven TTP mappings are usually applied for non-structured languages such as English and Danish. Artificial neural network (ANN) and decision tree (DT) approaches are commonly applied in this task. Compared to the ANN methods, the DT methods usually provide more accurate pronunciation models. The DT methods can, however, lead to a set of models with a high memory footprint if the mappings between letters and phonemes are complex. In this paper, we present a weighted entropy training method for the DT based TTP mapping. Statistical information about the vocabulary is utilized in the training process in order to optimize the TTP performance for pre-defined memory requirements. The results obtained in the simulation experiments indicate that the memory requirements of the TTP models can be significantly reduced without degrading the mapping accuracy. The applicability of the approach is also verified in the speech recognition experiments.
منابع مشابه
Decision tree based text-to-phoneme mapping for speech recognition
In many embedded speech recognition systems, the phonetic transcriptions of the vocabulary items, i.e., the lexicons, cannot be stored to the device beforehand. A text-to-phoneme mapping functionality is hence needed to create the transcriptions from plain text. Several approaches have been evaluated in the literature. In this paper, a decision tree based text-to-phoneme mapping is studied. A d...
متن کاملApplication of Different Methods of Decision Tree Algorithm for Mapping Rangeland Using Satellite Imagery (Case Study: Doviraj Catchment in Ilam Province)
Using satellite imagery for the study of Earth's resources is attended by manyresearchers. In fact, the various phenomena have different spectral response inelectromagnetic radiation. One major application of satellite data is the classification ofland cover. In recent years, a number of classification algorithms have been developed forclassification of remote sensing data. One of the most nota...
متن کاملA novel voice conversion system based on codebook mapping with phoneme-tied weighting
This paper presents a novel voice conversion system based on codebook mapping. A new phoneme-tied weighting strategy is proposed to reduce the smoothing effects in weighted sum of code books, while a new prosodic conversion method by decision tree is proposed to cope with the complex prosody of Chinese. STRAIGHT algorithm is used to decompose spectrum and excitation for separate modification. L...
متن کاملExtended MULTIMOORA method based on Shannon entropy weight for materials selection
Selection of appropriate material is a crucial step in engineering design and manufacturing process. Without a systematic technique, many useful engineering materials may be ignored for selection. The category of multiple attribute decision-making (MADM) methods is an effective set of structured techniques. Having uncomplicated assumptions and mathematics, the MULTIMOORA method as an MADM appro...
متن کاملINFORMATION MEASURES BASED TOPSIS METHOD FOR MULTICRITERIA DECISION MAKING PROBLEM IN INTUITIONISTIC FUZZY ENVIRONMENT
In the fuzzy set theory, information measures play a paramount role in several areas such as decision making, pattern recognition etc. In this paper, similarity measure based on cosine function and entropy measures based on logarithmic function for IFSs are proposed. Comparisons of proposed similarity and entropy measures with the existing ones are listed. Numerical results limpidly betoken th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003